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August 23, 2012 

Abstract Shape-based regularization has proven to be a 
useful method for delineating objects from the noisy im- 
ages encountered in many applications when one has prior 
knowledge of the shape of the targeted object. When a col- 
lection of possible shapes is available, the specification of 
a shape prior using kernel density estimation is a natural 
technique. Unfortunately, energy functionals arising from 
kernel density estimation are of a form that makes them 
impossible to directly minimize using graph cuts. Here, we 
show how one may recast the energy minization problem 
into a form that is minimizable iteratively using graph cuts. 

Keywords Image segmentation • MM • graph cuts • 
energy minimization • statistical shape prior • kernel 
density estimation 



1 Introduction 

Graph cuts provides an ingenious technique for image seg- 
mentation that relies on transforming the problem of en- 
ergy minimization into the problem of determining the 
maximum flow or minimum cut on an edge-weighted graph. 
By using graph cuts, segmentations can be found efficiently 
in low-order polynomial time. When images are noisy, seg- 
mentation performed using only image data produces poor 
results. In these cases, regularization is needed. 
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Typically, when one is analyzing images, one has ex- 
pectations for what he or she is looking for - people, birds, 
cars, cows, bacteria, whatever the object may be. In these 
cases it is natural to try to restrict segmentation to results 
that match the shape of the desired object. Within any class 
of objects there is often variability in shape. Such variabil- 
ity can arise even when dealing with shape-invariant ob- 
jects, for example by changes in pose or perspective. Ex- 
pression of shape knowledge probabilistically is a natural 
way of capturing this variability. 



1.1 Related prior work 

In Chang et al [4], we presented a technique for tracking 
an object whose boundary motion is modeled using the 
level-set method. Such objects do not retain any particu- 
lar a-priori shape. We remarked that one could adapt our 
regularization method to the segmentation of objects that 
do retain their shapes. The purpose of this article is to 
explicitly present a method for performing such a task. 

Our method is grounded in the theory of graph cuts- 
based image segmentation with shape-based regulariza- 
tion, where segmentation is performed using a-priori shape 
knowledge. The prior literature on this subject is vast. 
Briefly, we mention a few influential articles. One of the 
first successful forays into incorporating shape-based regu- 
larization into graph cuts was the work Slabaugh and Unal 
[22] who developed a method for segmentation assuming 
that the desired object is an ellipse. Allowing for more 
flexibility, Freedman and Zhang 12] provided a method to 
segment any particular shape based on the signed distance 
embedding. Some methods have concentrated on looking 
at shapes with certain characteristics such as compacted- 
ness or local convexity [24 9]. Other methods have treated 
the task of space specification as a statistical problem. 
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Statistical shape priors have taken many forms in the 
literature [18] (8J [27]. Two of the most common forms of 
shape representation are kernel-PCA and kernel-based level- 
set embeddings. Both methods learn a probability distribu- 
tion for the target shape based on a set of training shapes 
or templates. They differ in how they use the training im- 
ages. Kernel PCA-based methods project the training tem- 
plates into spectral space and define probability distribu- 
tion over features. Level-set-based methods define a dis- 
tance between implicitly embedded shapes and define a 
probability distribution with respect to the distance. 

One major disadvantage of the PCA-based approach 
is that the resulting probability distribution for the shape 
is Gaussian. Given a set of training templates, this ap- 
proach will tend to favor shapes that look closer to the 
space averaged-shape, thereby potentially making certain 
training shapes improbable. This problem exists as well 
for exponential-family shape priors where the log-prior is 
a linear combination of the training templates. Combina- 
tions of shapes may not be valid shapes [7], and under this 
approach certain training templates may become improb- 
able as well. Instead, we opt for a kernel-density based 
approach. 

Our method here, and that given in Chang et al [4], can 
be described as a graph-cuts adaptation of the method 
of Cremers et al [6]. In their paper, they treated shape- 
prior specification as a density estimation problem and 
used kernel density estimation (KDE) to derive a prior 
from an ensemble of training shapes. The resulting energy 
functionals, being nonlinear, require some intervention to 
use in a graph-cuts framework. 

In this manuscript, we make the main contribution of 
demonstrating the use of majorization-minimization (MM) 
for the iterative relaxation of a nonlinear energy functional 
using graph-cuts. The MM algorithm is a generalization of 
the well-known expectation-maximization (EM) algorithm. 
It is often useful for linearizing nonlinear objective func- 
tions. We use MM to find a surrogate minimization prob- 
lem that is solved using graph-cuts. The use of graph cuts 
results in a computationally more efficient approach rela- 
tive to that of using level set approaches [15] . 

2 Mathematical Method 

2.1 Shapes 

Signed-distance functions provide a handy tool to repre- 
sent shapes mathematically. Shapes or regions Q C R d can 
be implicitly represented by a function (j>o{s) : M. d — > R, 
which for every s £ M. d is the signed Euclidean distance 
from s to the boundary of fl. In this paper we take the 
convention that 4>q (s) is positive if s € fl, and negative if 

s <= R d \ n. 




Fig. 1 Embedding of shapes using signed Euclidean distance 
functions. A shape (pictured is a leaf) is embedded as a signed 
Euclidean distance function where values inside are the positive dis- 
tance from the boundary, and values outside are negative distances 
from a boundary. The boundary of the leaf is the 0-level set of the 
function (red outline). 



In many applications, since the shape of an object is 
only approximately known, it is advantageous to represent 
the knowledge of the shape probabilistically. We use a ker- 
nel density estimate of the distribution of possible shapes 
embedded as a collection of individual discrete level sets. 
Let us first denote Xo the characteristic function for a re- 
gion J?, 

fi, sen-, 

Xn s = (1) 
[0, s^J?. 

Then, for two shapes fi and A, embedded as signed dis- 
tance functions (j)Q and 4>j\, we introduce a pairwise shape 
energy 

mass mismatch 
, s 

t/sha pe (^) = / (xn(s)-XA(s'))\Ms'tds 

boundary mismatch 
, "> * 

+ / \Mz')\ X dz. (2) 
JdQ 

This expression is a generalization of other shape energies 
seen in the literature, where commonly A = 0,1,2. To 
make this energy robust to changes in scale, orientation, 
and location, a transformation s' = T(s) that maps points 
s in the reference frame of fi to points s' in the reference 
frame of A is needed. In Appendix|A] we provide a method 
for finding rigid transformations of the form 

T(s) = aR(w) (s- c) 
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where a is a scaling factor, R(w) is a rotation matrix, 
and c is a translation vector. These three parameters are 
chosen to minimize Eq [2] (See Appendix [A] for details on 
calculating these parameters). This method worked well 
for our examples; however, other shape alignment schemes 
may be used [161 US [26l EHl H] . 

After transformation, we can use the energy in Eq [2] 
to assemble references shapes {flj}j = i into a probability 
distribution by using a kernel of the form 



Ap-/3tW(«:A) 

2tt 



(3) 



We can then represent a probability distribution over shapes, 
p(fl) as follows: 



(4) 



where {wj}j =1 , are weighting coefficients. This represen- 
tation of the prior is a kernel density estimate [21] (KDE) 
of the distribution of shapes. We set f3 to be a multiple of 
the following value: 



T 2 = 



y Wj min d(f2k, flj 

3 = 1 



The parameter (3 can be considered a reciprocal "temper- 
ature," or belief in the shape model implied by the given 
templates. If /3 = t 2 , then the given templates give a good 
description of the possible shape space. In most applica- 
tions, however, the number of templates available will be 
small and will not provide a complete picture of shape 
space. When the number of provided templates is small, 
r 2 can in practice become rather small, leading to weak 
regularization. Setting j3 to a high multiple of r 2 allows 
one to "boost" his or her belief in the available templates 
as a predictor of the segmented shape. Note that this ex- 
pression of the shape prior is potentially multi-modal. 



2.2 Generative image intensity model 

The task is to identify an object fl in an image I : S (Z 
M. d — > K 9 , where q refers the the dimension of the color 
space used. We model an image probabilistically with dis- 
tributions of intensity values conditional on a parameter 
vector 9, and region: 

T , , \pn{I\0) se fl (foreground); 
J(s) - < 6 
[pa(I\0) s e A = S \ Q (background). 

Information about 6 and fl can be incorporated as prior 
probability distributions p{6\fl) andp(i7). With these prior 



distributions, we can write the joint posterior distribution 



p(n,o\i) cx p(i\Q,e)p{e\n)p{Q) 



-u(n,e) 



vhere 



U {fl, 6) is an energy (not to be confused with C/ S h ape )- 

We wish to infer the segmentation fl by maximizing 
the posterior probability relative to fl. To this end, we will 
maximize the logarithm of the posterior, or equivalently, 
minimize the energy 



likelihood 



U(fl,9) = -log[p(I\fl,e)p(9\fl)p(fl)} 
= -J2 logpfl(l(s)|0) - £ logJ*(J(s)|0) 



sen 



s£A 



-logX>^(^i) 



Eqs.[3)|4] 

logp(9\fl) + const. 



(7) 



We take an iterative two-step approach to minimizing this 
energy. Given the segmentation estimate i?W i n the n-th 
step, we minimize the energy with respect to 6 [19]. 

Given 0^ n \ we find the optimal by iteratively 

minimizing 



(5) Q{fl\fl^) = - ]T logp n (I(s)|0) - log PA (I(s)\d) 



sen 



seA 



pxv*±M ( 8) 



majorization 

In Eq [HJ we have replaced logarithm- sum term of Eq. [8] 
with a linearized majorization term, by exploiting the con- 
cavity of the logarithm function. The majorization term de- 
fines a surrogate minimization problem that is more easily 
solved - in our case using graph cuts - albeit at the price 
of iteration. Such an algorithm is commonly known as a 
majorization-minimization or MM algorithm 14] (see Ap- 
pendix^. Such an approach is generally possible for other 
energies that contain components of the form 



where the function / : R 
functional dependent on fl. 



is convex, an 



d J 



2.3 Graph cuts for surrogate energy relaxation 

Here, we describe minimization of the surrogate energy 
in Eq [8] using graph cuts. Graph cut methods have their 
grounding in combinatorial optimization theory, and are 
concerned with finding the minimum-cost cut in an undi- 
rected graph. A cut is a partition of a connected graph 
into two disconnected sets. The cost of a cut is the sum 
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of the edge weights along a cut, and a max-flow min-cut 
algorithm finds the cut with the lowest cost. To use graph 
cuts for image segmentation, we must express our energy 
function in terms of edge-weights on a graph. Following El 
Zehiry et al [10], we begin by expressing the energy given 
in Eq [8] as a function of the vertices V and edges £ of a 
graph g = (V,£): 



E 

(s,u)e£ 



U e (B,u). 



In the graph that we construct, each pixel is assigned a 
node, and edges exist between nodes representing neigh- 
boring pixels (Fig [2j. The neighbor edges are known as 
neighbor-links (n-links). Two special nodes called the source 
(foreground) and sink (background) are added, along with 
edges connecting these nodes to each pixel node (Fig [2j. 
These edges are known as terminal-links (t-links). 

We want to infer an unknown two-coloring on the 
nodes of the graph that represents inclusion of a node s 
into either the foreground set S7, or the background set A. 
This inference involves splitting the graph into two parts 
(Fig [2], where a pixel's connection to the source represents 
inclusion into the foreground set. 

Discretizing the shape divergence (Eq.[2| over an eight- 
connected neighbor graph yields 

U a hape(fi , A) = 

indicator of label mismatch 



E 



\ xn ( S ) - XA (s)\ i^( S ')r 



indicator of s.u lying across boundary of Q 

E Kxn(s)(l - Xfl(u)) + Xfl(u)(l - Xn(s))\ 

s.uGN 



S — U 



(9) 



where N refers to the set of tuples of nodes that are neigh- 
boring in space. The distance function <f>\ is taken to be 
the Euclidean distance embedding of A, transformed to 
the center, scale, and orientation of Q. 

Using this expression, it is easy to see that 



and 

U £ (b,u) = 

tt(3 
161 Is — 



. lWk K{Q^),n k ) 



Xo(s) (1 - Xfi(u)) + Xn(u) (1 - xn(a)) 



(11) 



The neighbor energy given in Eq|II]obeys the submodularity 
property, that is, 



U £ (s e 0, u £ Q) + U £ (s g Q, u e f2) 
nf3 JwjKWto,^) fa, 



> U £ (s e n, u e Q) + U £ (s g Q, u £ Q) 
= 0. 

As a result, the energy (Eq[8j is minimizable with graph- 
cuts in polynomial time [2lHI]. 

To embed Uy{s) into the graph, we set the weights of 
the t-links between each pixel to the source to the following 



log^(/(s)|0) 



2 hx T,Li^kK{Q^,Q k ) 



4>o, (s')| 



(12) 



and the weights of the t-links between each pixel to the 
sink to the following 



w(s,v A ) = -logpn(I(s)|0) 



j 

3=1 



Y; J k=1 w k K(f2W,n k ) 



(13) 



The cutting of the edge from an s to v q implies that 
s G A, so it adds to the cost of the cut by the contribution 
of s into the total energy as if s G A. In other words, 
these weights can be interpreted as a pixel's strength of 
belonging to each region. 

To embed Us into the graph, we set the n-links be- 
tween pairwise neighboring pixels s and u to 
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U v (s) = -Xo(s) logp fl (I( B )|0) - (1 - X o(s)) log PA (I(s)\8) 



7T/3 



E— 

3=1 







0flj ( 





J k ^w k K(QM,n k ) 



(14) 



p'wjKjnW^j) |0r2, (s)| A \xo(s)-Xa } (s')\ 
2 f-^ 

3 = 1 



(10) 



Our surrogate energy is now minimized by finding the 
minimum cut of the graph. For details on how to perform 
this optimization, we refer the reader to Boykov and Kol- 
mogorov [2]. 
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Fig. 2 (left) Embedding of segmentation into a graph In the graph-cuts framework, the problem of segmentation is transformed into the 
problem of finding the minimum cut in an edge-weighted graph. The energy to minimize is rewritten as a sum of atomic terms and a sum 
of neighbor-interaction terms. The edges are weighted such that the cut is the energy of the corresponding segmentation, (right) Resulting 
segmentation. The cutting of the graph yields two disconnected node sets where Q is the foreground and A is the background. 



3 Results 

We have implemented our method in Java using the Fiji 
API [20] and tested it on the regularization of luminescence- 
based segmentation of a variety of objects in images (Fig[3j. 
For comparison purposes, we have obtained results of seg- 
mentation using a shape-free global length penalty [10] . 
In these examples, we have used Laplacian statistics for 
p(I\9) and uninformative priors for 6 (by setting p{6) = 1. 
The parameter A was set to 2, and the inverse-temperature 
j3 was set to 200r 2 , except as otherwise noted. 

The first images we processed were low-contrast noisy 
images of a van. A van (Dodge B2500) was driven across 
a bumpy road and videotaped as it entered and exited the 
camcorder's field of view. Snapshots of the recording were 
randomly extracted, showing the van in different regions 



an of the image, under different orientations and difference 
scale factors. We manually constructed a shape prior for 
the van by hand-drawing five templates of a van in many 
poses. Shape-free segmentation of the van captured the 
dorsal aspect of the van, but was inaccurate in delineating 
its ventral features. Using the shape prior, we obtained an 
admissible result that is recognizable as a van. Although 
a van maintains a rigid structure, its shape in images can 
still be considered probabilistic due to uncertainty in pose. 

We also processed pictures of leaves from the Cal- 
tech 101 image databas^] In the given examples, back- 
ground objects interfere with the segmentation. Without 
a shape prior, segmentations based on pixel intensities 



1 http : / /www . vision. caltech. edu/Image_Datasets/ 
leaves/ 



Algorithm 1 MM-Graphcut algorithm 

Obtain initial guess of image intensity model parameters 

Obtain initial guess of the segmentation Q(°>, by seg- 
mentation without a shaper prior. 
Align and rescale the shape prior templates to fi(°>. 
n = 1 

while c7 shape (r2("\ flC"" 1 )) > tol do 

Reweight the edges and perform max flow to estimate 

Q(n+l) 

Re-estimate the image intensity parameters 0^ n+1 ^ (as 
needed) 

Re-align the shape prior templates to the guess 
(as needed) 
n + + 
end while 
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include these background distractors. To construct tem- 
plates, we hand-traced six representative shapes for each 
of the two types of leaves. The regularized segmentations 
that we obtained were able to control for the background 
distractors. 

In Fig^ we segmented a five-pointed leaf using a prior 
composed of both three-tip and five-tip templates, using 
ft = 600t 2 . Although the mean shape of the hybrid-shape 
prior is a leaf with seven tips, the algorithm relaxed into 
the correct five-tip state. 

4 Discussion 

We have presented a method for regularization of segmen- 
tation using a shape prior that is learned through kernel- 
density-estimation. Using a majorization-minimization trick, 
we showed how one can minimize the resulting energy, that 
includes a non-linear kernel-density term, using graph- 
cuts. Although there have been previous studies on in- 
cluding statistical shape priors in graph-cuts segmenta- 
tion [27, 5], these methods have used unimodal shape prior 
distributions where the input shape templates no longer 
retain their a-priori weights. Our shape-prior is a multi- 
modal kernel density estimator like the one used by Cre- 
mers et al [6], thereby enjoying all of the benefits outlined 
in Cremers et al [6]. 

The advantage of our method however is speed. Speed 
has become a big issue with the inception of the "big data" 
world. IT is also an issue in tracking applications, where 
many successive images need to be processed. Using an 
MM algorithm, we find surrogate energy functionals that 
can be minimized quickly using graph cuts. The surrogate 
energy functional (Eq[8j contains an iteratively re-weighted 
shape regularization term, where shapes that are closest to 
the current segmentation estimate are the most influential. 
In our examples, convergence of segmentation typically 
takes place in few (< 10) iterations. 

4.1 Choice of /3-multiplier 

When the number of available templates are large, the 
kernel-density-estimated shape prior is a good representa- 
tion of the shape space of the desired object. In this case, 
ft ~ t 2 can suffice. Yet, even in this case, there are situ- 
ations where it is desirable to increase ft. For example, in 
the event of likelihood mis-specification, increasing ft can 
compensate for a poor likelihood model. 

There are however caveats to increasing ft. The major 
caveat is that if ft becomes too large, the energy functional 
will tend to develop steep local minima about each shape 
template. Such a situation can prevent convergence of the 
algorithm. If a large ft is necessary, then one suggestion is 



to treat it as an annealing parameter and increase it only 
after a few iterations of the algorithm. 

4.2 Computational considerations 

Each iteration of the MM-algorithm is solved quickly using 
graph-cuts. Many efficient algorithms for this optimization 
exist, yielding excellent performance on modern hardware. 
There are many articles discussing the performance ad- 
vantages of graph- cuts [3, 13]. For even more speed, there 
are GPU-accelerated approaches [25] and narrow-banding 
approaches (22J Q7j. 

The minimization of the surrogate energy is not the 
bottleneck of our method. To unleash the full potential of 
this shape regularization method, one needs to optimize 
the processing steps that are in the periphery the graph cut 
optimization - the inference of the image intensity model, 
and the alignment of the templates. 

As far as we know, other methods of shape-regularized 
segmentation also require these processing steps. Fortu- 
nately, these operations can be easily parallelized. Further- 
more, they need-not be repeated in every step of the algo- 
rithm. The image intensity parameters need only change if 
a significant proportion of the labels change. Similarly, the 
templates only need re-alignment if the segmentation has 
significantly changed. To improve performance, one may 
calculate the transformation needed for a single template, 
and use that transformation for the entire set of templates. 
One may also use a fast method such as that of Jiang and 
Tomasi [15] to provide an initial state for shape alignment. 

4.3 Future directions 

The MM algorithm is useful for linearizing objective func- 
tionals. In this paper, we have presented it as a tech- 
nique for linearizing a nonlinear shape prior, yielding a 
linearized surrogate energy functional. It may in general 
be a useful tool for the graph-cut community in order to 
relax other useful nonlinear functionals. Finally, the MM 
and EM algorithms may have use in inference of more- 
robust likelihood models. In our opinion, it is an avenue 
worth investigating. 
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Fig. 3 Shape-regularized segmentation From left to right: original image, length-penalized segmentation, formation of a statistical shape prior 
using a collection of templates, shape-regularized segmentation. Segmentation of the original image of a van without shape information misses the 
wheels. A collection of templates taken together define a probabilistic shape prior. Such representation of shapes is useful since it can account 
for intrinsic variation in a class of shapes, as well as variations that result from changing pose. In these templates, silhouettes of a van are 
taken in different poses. 




Fig. 4 Segmentation using a hybrid shape-prior. Using a com- 
bination of three and five-tip leaves, a "hybrid" shape prior is con- 
structed. Although the average of the shape templates is a leaf with 
seven tips, the kernel-density shape prior model is able to settle into 
a shape with the proper configuration of five tips. 



A Shape alignment 

In order to align the shape templates, we choose the affine 
transformation T(s) = aR(u) (s — c) that minimizes the 
shape divergence energy 



Khape (T(c,UJ,a)) 



mass mismatch 



(xfi(s)-XA(T(s))) |^(T(s)) 



ds 



boundary mismatch 



+ / <5(^(s)) &i(T(s)) ds 



In this expression, we have rewritten the contour integral 
on dft in terms of an integral over the zero-level set of <f>n- 
Here, we develop a local Newton-Raphson algorithm for 
finding the optimal transform. Let us denote the 2 • d col- 
umn vector of transformation parameters tp = [a c u)]. 
We estimate ip using the iterative updates 



tfn+l =<Pn- [H v (¥>„)] 1 V v C/ shap e(v r , 
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where 



and 



hape 



d_ 
da 

da 2 c da "da 
V c — V C VT V^Ve 
d 

v w — v^vj v„v^ 



shape • 



To populate the matrix, we need to calculate the gradients 
with respect to a, c, and u). For A > 0, the first-order 
derivatives take the form 

(xfl(s) ~ Xa(s')) 2 '' + S(M*)) 
(p A (s') 1 sgn(0 yl (s')) ^- VMs^ds 

s -v-' dxl 

pxd 

The sign function comes about from differentiation of the 
absolute value function in the distributional sense. 

For A 7^ 1, the second-order derivatives take the form 

V<p V^.f/shape = 

A(A - 1) / \( Xn (s) - x^(s')) 2 + S (Ms)) 



^,A-2 dT 



\<f>A@T~' 1 ^ L V^(s')V T A (s') |^| ds 



dT 



+ 2A / £(<MO) 



IM^'t'' |^V^(s')V T ^(s') (J^- ) ,/ s 



+ A 



X«(s) - X^(s') ) +ff(^n(s)) 



i^(s')r 



9T 



<9T 



ixpxd 



I'M 8 ') I ~ s g n ^( s ')^f^ — : V^(s')ds. 



If A = 1, the first term is zero and the following term is 
added 



2A 



/ 5 (<Ms')) (xa(s) ~ Xa(s')) sgn(^(s')) 
«V^(s')(^V^(»')) T <i S . 



The contour integrals in these expressions can be calcu- 
lated using a regularized version of the Dirac delta func- 
tion such as 



1 



for small e. 



Similarly, the characteristic function can be interpreted as 
the Heaviside function acting on the signed-distance shape 
embedding, which can be approximated using an approx- 
imation of the Heaviside function such as 

11 x 
H e (x) = — H — arctan — for small e. 
2 7r e 

The nonzero transformation derivatives are as follows 

dT 



aR 



dT 



R(s-c) 



da 

dT 0R, 

d 2 T _ dR 
)u;<9c du> 
d 2 T dR 



dadu: dijj 
d 2 T d 2 R 



dudu) dujdu 

d 2 T 

R. 



(s-c) 

(S-C) 



dadc 



B MM algorithm for iterative graph cuts 

The shape term log Wj K(f2, Qa) can make minimiza- 
tion of the energy difficult, since its formulation involves 
a sum within a logarithm, making the energy functional 
nonlinear with respect to the labeling of pixels (into back- 
ground and foreground) in the image. 

To linearize the shape contribution, we will derive a 
surrogate function with separated terms. A function f(x\xk), 
with fixed Xk, is said to majorize a function g(x) at Xk if 
the following holds [14] 

g(x) < f{x\x k ) 
f(x k ) = g(x k \x k ). 

We wish to perform iterative inference by finding a se- 
quence of segmentations = argmin^ Q{Q\fI^ n '), 
where Q(i7|J?'"') majorizes Eq [7] By the descent prop- 
erty of the MM algorithm [14], this sequence converges to 
a local minimum. 

For any convex function f(x), the following holds [14] 
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Noting that — log(-) is convex, we have 

shape kernel density 



3=1 

'Y, J k=1 w k K{Q^\Q k ) 



x log 



WjK(f2,f2j) 



- V — r^— — log K (fi, (i*) 

+ const 

' V ' 

separated log shape kernel density 

+ const. 



(15) 



Since Eq [15] majorizes the log-kernel density, we can 
minimize our original energy by iteratively minimizing 

Q{Q\Q^) = -J2 logp n (/(s)|0) - ]T \og PA (I( S )\0) 



sen 



seA 



-logp(0\f2) 

Since the distance function can be written as a sum over 
the vertices, so can Eq [16] As a result, it is possible to 
minimize Eqpb] within the graph cuts framework. 
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